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. Abstract The main purpose of this paper is to give a solution to a conjecture 

pLn ' concerning a Pade family of iterations for the matrix sector function that was 

Q> . recently raised by B. Laszkiewicz et al in [A Pade family of iterations for the ma- 

trix sector function and the matrix pth root, Numer. Linear Algebra Appl. 2009; 
16:951-970]. Using a sharpened version Schwarz's lemma, we also demonstrate a 
strengthening of the conjecture. 
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1. Introduction 

Let p > 2 be an integer. The matrix sector function was introduced in [19] as a generalization 



> 

0^ . of the matrix sign function. The matrix sector function of A can be defined as 

cn 

(N : sectpiA) = A{AP)-'^/P 

O 

where (^4^)^/^ is the principal pth root (see ^) of the matrix A^. For p = 2 the matrix sector 
function is the matrix sign function [13]. In his 2008 book [9] Nicholas Higham remarked on 
^ . page 49 that "a good numerical method for computing the matrix sector function is currently 

■ lacking". 



Let k and m be non- negative integers. A Pade approximant [k/m] to the complex scalar func- 
tion f{z) is a rational function of the form Pkmiz) / Q km{z) , where Pkm and Qkm are polynomials 
of degree less than or equal to k and m, respectively, Qkm{0) = 1 and 

fiz) - = 0(^^+"^+^) as z ^ 0. 

For given k and m if the Pade approximant [k/m] exists, then it is unique. It is usually required 
that Pkm and Qkm have no common zeros, so that Pkm and Qkm are unique (see [9l p.79]). 
Detailed account of the theory of Pade approximation can be found in [3| • 
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Let now Pkm{z) /Qkm{z) be the Pade approximant [k/m] of the function (1 — z)"^/^. Since 
(1 — z)""*^/^ = 2-^i(l/p, 1; 1; z), where 2F1 is the Gauss hyper geometric function, we may apply 
the formulas for the Pade approximants to this function from [21 p. 65] or [20] yielding 

™ (-m)j(-i . 1 
Qkm{z) = 2^ — ■](_!, ^ \ = 2Fi{-m, k;-k- m; z), 

^{-k)j{l-m)j . 1 

Pkmiz) = y ^ .., , r — z^ = 2Fi{-k, -m + -;-k- m]z), (1.1) 

^ jl{-k-m)j p 

where we assume k > m — 1 > are integers and {a)k = a • (a + 1) • • • (a + /c — 1), (a)o = 1. We 
will rederive this formulas in Lemma |2 . 1 1 below simultaneously finding the approximation error. 

The scalar Pade iteration for the function where ^ denotes the principal pth root, 

corresponding to the Pade approximant [k/m], has the form 

xi+i = hkm{xi) := XI J"""^. , xq = A. (1.2) 

The swap between the scalar p-sector function and the pth root holds also in matrix settings 
and therefore from the Pade family for the matrix sector function, one obtains a family of 
iterations for the matrix pth. root ^6; formula (36)] 

Xi+, = XiPkmil - A-^Xf)Qkm{I - A~^Xf)-\ Xo = I. 
Computation of matrix pth root has aroused considerable interest recently, see for example 

[a El El dm nj. 

The following is a sample of iteration functions hj^-mix) from the scalar Pade family (|1.2p : 

h px 
hoo = X, hoi 



xP + {p- 1) 



hio = -[-xP + [1 + p)\, /ill = X 



02 



h 



12 



p {p + l)xP + {p — 1) 

2p^x 

{-p + l)x2p + (4p - 2)xP + (2p2 _ 3p + 1) 

2px[{2p - l)xP + {p + I)] 
{p + l)x2p + (V + 2p- 2)xP + (2p2 - 3p + 1) 

^[{p + l)x2p - (4p + 2)xP + (2p2 + 3p + 1)] 
2p^ 

X {-p + l)x2P + xP(4p2 _ 2p - 2) + (2p2 + 3p + 1) 
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2p {2p + l)xP + (p - 1) 

x[(2p2 - 3p + l)x2p + (8p2 - 2)xP + (2/ + 3p + 1)] 
(2p2 + 3p + l)x2p + (8p2 - 2)xP + (2p2 - 3p + 1) 



The Pade approximant [1/1] provides the Halley method for the sector function that was 
considered in [14^ I15j. B. Laszkiewicz et al [16] have proved the following relation for hoi. 



Theorem 1.1. Let xq = A lie in the region L^'^'^^^ := {z £ C :\ 1 — |< 1} and let the sequence 
{xi} be generated Hqi. Then 

|l-xf| < \l-xPf\ 

They also posed the following conjecture [TBI Conjecture 4.2]. 

Conjecture 1.2. Let xi be the sequence generated by for k > m — 1. If xo lies in the 

region Ljf°'^^\ then 

< (1.3) 
The main purpose of this paper is to demonstrate the validity of this conjecture. 

2. Main Results 

We start with some useful lemmas. 

Lemma 2.1. The following relation holds true: 

Pkm{t) _ klm\C-)k+i{l - ^,^^^, R,^{t) ^ ^2^^^ 



Qkmit) ' ' (fc + m)!(/c + m + l)! QfcmW 



where Rkm = 2F1 



m + l,A: + i + l 
m + k + 2 



t 



Proof. In order to find the relation between Pkm{t) and Qkmit) we want to apply Euler's trans- 
formation which reads [H formula (2.2.7)] 

2Fi{a,h,c;x) = (1 - xY~''~^2Fi{c - a,c - b,c]x). 

However, since Pkmit), Qkmit) are polynomials, we cannot use this formula directly. To find the 
right modification, choose any 6 £ (0, 1) so that 

1 _i 1 

2Fi{—k — 6, m; —k — m — 6;t) = [1 — t) P2Fi(—m, k — S; —k — m — 5; t). 

P P 

_i 

Take limit (5 — >■ on both sides. On the right hand side, we immediately get (1 — t) pQkm{t)- 
On the left we have 

2F,{-k-6,l-m;-k-m-6;t) = l + —j^^^^t + 

(-fc-<5)(-fc-<5+l)-(-^)(l-m)-(l-m+fc) 
■ ■ ■ + (-k-m-6){-k-m-5+l)-(-m-5)(k+iy. ^ 
(-fc-g)(-fc_^+l)...(-5)(-3+l)-(-3+m)(l-m)-(l+fc) 
+ {-k-m-6){-k-m-5+l)-(-5)(k+m+l)\ * ' ' ' • 

Taking limits 5 — t- we get 

(1 - t)-pQUt) = Pkmit) + (-1)"- '^+^+\ k+m^l + . . . 

(fe + m + Ij! 
(fc + m + 1)! \ rri + k + 2 



By using — m) 



(-l)™(^)fc+i(l - we obtain the identity 



□ 



According to Euler's integral representation of the Gauss hypergeometric function The- 
orem 2.2.1] and Euler's reflection formula T{z)T{l — z) = tt/ sin(7rz) [T| Theorem 1.2.1] we 
have 

1 1 1 



(1-t) P =2Fi(l/p,l;l;t) 



sin(7r/j)) 



vr 



-"^ UP "^(1 — n) 
l-tu 



-du. 



(2.2) 



Lemma 2.2. All the Taylor coefficients of are positive for k > m — 1 > 0. 



Proof. First we note that representation (|2.2p implies that the function (1 — t)~p, which is 
defined in the whole complex plane except a cut [l,oo) on the positive real axis, is a Stieltjes 
function [3, formula (5.1.1)]. Since we have required k > m — 1, then according to [3, Theorem 
5.2.1] 
have 



has simple positive poles lying in (l,oo) with positive residues. Summing up, we 



Pkmjt) 



.7 = 1 ^ 



with \j > 0, < aj < 1 and R{t) is a polynomial of degree k — m (or zero if A; = m — 1). This 
formula makes it clear that all power series coefficients of ^""^j^l are positive possibly except the 
the first k — m + 1 influenced by the polynomial R{t). But those are also positive as we have 
proved in Lemma l2.1i □ 

We next present an identity which might be of interest in its own right. 

Theorem 2.3. The following identity holds true: 



(t(a + 6-l)-c+l)2-Fi 



a, b 



t 2F1 



1 - a,l 

2 - c 



t(l-0(l-Q)(l-b) 
H 



2 - c 
t{l - t)ab 



a, b 
c 



a + 1,6 + 1 
c+1 



1 

t 1 2F1 ^ 
t] 2F1 



2 - a,2 - 6 

3 - c 

/ l-a,l- 
\ 2-c 



t] = 1 - c. (2.3) 



Proof. Write 



a, b 
c 



t , G{t) = 2F1 



1 - a, 1 
2-c 



F{t) = 2F1 

Then identity (|2.3p takes the form 

$(t) := (t(a + 6 - 1) - c + l)FG + t{l - t)[FG' - OF'] = 1 - c. 
Since F{0) = G(0) = 1 all we need to prove is ^'{t) = 0. Differentiation yields: 
$'(t) = (a + b-l)FG+{2-c-{3-a-b)t)FG'-{c-{a + b+l)t)F'G+t{l-t)FG"-t{l-t)GF". 
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The hypergeometric differential equation reads: 

t{l - t)F" + (c - (a + 6 + l)t)F' -abF = 

for F and 

t(l - t)G" + (2-c-(3-a - b)t)G' - (1 - a)(l - b)G = 
for G. Simple rearrangement of the expression for ^'{t) then gives: 

= -G[t{l - t)F" + (c - (a + 6 + l)t)F' ~ abF] - abFG 
+ F[t{l - t)G" + (2-c-(3-a - b)t)G' - (1 - a)(l - b)G] + (1 - a)(l - b)FG + {a + b- l)FG 

= FG{-ab + (1 - a)(l - 6) + (a + 6 - 1)) = 0. 

□ 

Remark. Identity (|2.3p is related to several well-known formulas for hypergeometric functions 
such as Legendre's identity, Elliott's identity and Anderson- Vamanamurthy-Vuorinen's identity. 
See [1] for details. 

Now taking a = — m, b = —k — i, c = —k — m in the identity (12.30 we get 
+ m + 1)(1 - t) - t)Qfe„(t)i?fc™(t) +Pt(l - - - 

= p{k + m + 1). 

Rewriting formula (12. ip from Lemma 2.1 in the form 

= (1 - - (r'iul l iTl. t'^-'^'RUt) (2.4) 

(fc -|- m)\[K + m + Ij! 

and substituting (|2.4|) for Pkmi't) we obtain after some simple algebra 



fc!m!(i),+i(l-i). 



{k + m)\{k + m + l)\ 

{{p{k + m + 1)(1 - t) - t)Qkm{t)RkUt) - i)Qkm{t)R'k^{t) - pt{l - t)Q'^^{t)Rk^{t)] 



pk\m\{^k+i{l-l)m 



[{k + m] 
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This leads to the following statement. 
Theorem 2.4. Suppose A; > m — 1 > and let 



fUt) = 1 - (1 - t) i^^Y ■■= T.cun.,t\ (2.5) 



Qkmip) J 

T/ien 

Cfcm.o = • • • = Cfcm,fc+m = 0, Ckm,i > for i > k + m + 1. 



Proof. It is easy to see fkm{0) = 0, so the constant term Ckm,o = 0. The desired conclusion 
fohows from the observation that 

/p. (t)\P~^ 1 

f'kmit) = • ,n ■ [Pkmmkmit) + p{t - 1) (P^lOQ^rn (t) - Pfcm(t)Qi.^(t))] 

\Qkm{t)J yQkm\t)) 



Qkm{t)J (QkmitW P + m)!]2 

by the formula preceding this theorem. The expression on the right has positive Taylor coeffi- 
cients starting from the term t^+"^ by Lemma |2.2[ □ 

Remark. The case [1/1] has been proved in |17] using a different approach. 
Now we are in the position to give a proof to Conjecture 11.21 

Proof. Let xq G Lp^"''^^^ and let t = 1 — Xq. Then \t\ < 1. From p.2p we obtain 

Since fkm{t) has non-negative Taylor coefficients by Theorem 12.31 it is clear that 

max|/fcm(t)| = fkmi'^) = 1. 
1*1=1 

Then by Schwarz's lemma [31 Theorem 5.4.3] for all xq G L^jf"''^^^ 

|l-x?| = |A™(l-xg)|<|l-xg|^+-+i 

so that by an induction argument we see that l\1.3\i holds. This completes the proof. □ 

In the next theorem we show that the speed of convergence is in fact even higher then con- 
jectured in |16) and proved above. Hence we have a strengthening of the Conjecture 11.21 

Theorem 2.5. Let xi be the sequence generated by for k > m — 1. If xq ^ Lp^"''^'^\ then 
|l-.f|<|l-.SI<— '(i^^) , (2.7) 



where 



pk\m]a)k+i{l-l)m ^ 

a = — < 1. 

(A; + m)l{k + m + 1)1 
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Proof. Follow the proof of Conjecture 11.21 up to application of Schwarz's lemma. Let a be the 
first non-zero coefficient of fkm- Then apply [iSl Lemma 2] yielding {ti = 1 — x^) 

I . I If (. \\ ^ \+ \k+m+l + " ^ (u \k+m+l \^l-2\ +« ^^ ^+'"+^ \ti_i\ +a 

\ti\ = \fUti-,)\<\u.,\ TT^T^ < TTW^\) i + a\t,.,\ 



|to|+a \ (^+-+1)'-^ ( M+a \ ( \U-^\+a 
l + a\to\) Vl + «l*il/ "'\l + a\ti^i\ 

(fe+m+l)'"^ + (fc+m+l)'"2H h(fc+m+l)+l 



< \to\ 



{k+m+iy 



\to\+a Y 
l + a\to\) 



where we have used the monotonicity of t — t- (t + q)/(1 + at) on (0, 1) in the ultimate inequality. 
Summing up the geometric progression in the exponent in the last line gives (|2.7p . The value of 
a is found from (12.61). □ 



Remark. The value of a is quite small even for moderate values of k and m. For instance, for 
the Halley method k = m = 1 we have a = {p + l){p — l)/(12p^). This shows that Theorem 
is a substantial improvement over Conjecture 11.21 as well as over Theorem II. 1[ 
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